Asymptotically consistent estimation of the number of change points in highly dependent time series
نویسندگان
چکیده
The problem of change point estimation is considered in a general framework where the data are generated by arbitrary unknown stationary ergodic process distributions. This means that the data may have long-range dependencies of an arbitrary form. In this context the consistent estimation of the number of change points is provably impossible. A formulation is proposed which overcomes this obstacle: it is possible to find the correct number of change points at the expense of introducing the additional constraint that the correct number of process distributions that generate the data is provided. This additional parameter has a natural interpretation in many real-world applications. It turns out that in this formulation change point estimation can be reduced to time series clustering. Based on this reduction, an algorithm is proposed that finds the number of change points and locates the changes. This algorithm is shown to be asymptotically consistent. The theoretical results are complemented with empirical evaluations.
منابع مشابه
A consistent clustering-based approach to estimating the number of change-points in highly dependent time-series
The problem of change-point estimation is considered under a general framework where the data are generated by unknown stationary ergodic process distributions. In this context, the consistent estimation of the number of change-points is provably impossible. However, it is shown that a consistent clustering method may be used to estimate the number of change points, under the additional constra...
متن کاملLocating Changes in Highly Dependent Data with Unknown Number of Change Points
The problem of multiple change point estimation is considered for sequences with unknown number of change points. A consistency framework is suggested that is suitable for highly dependent time-series, and an asymptotically consistent algorithm is proposed. In order for the consistency to be established the only assumption required is that the data is generated by stationary ergodic time-series...
متن کاملTwo-stage Procedure in P-Order Autoregressive Process
In this paper, the two-stage procedure is considered for autoregressive parameters estimation in the p-order autoregressive model ( AR(p)). The point estimation and fixed-size confidence ellipsoids construction are investigated which are based on least-squares estimators. Performance criteria are shown including asymptotically risk efficient, asymptotically efficient, and asymptotically consist...
متن کاملNonparametric Multiple Change Point Estimation in Highly Dependent Time Series
Given a heterogeneous time-series sample, it is required to find the points in time (called change points) where the probability distribution generating the data has changed. The data is assumed to have been generated by arbitrary, unknown, stationary ergodic distributions. No modelling, independence or mixing assumptions are made. A novel, computationally efficient, nonparametric method is pro...
متن کاملBayesian Estimation of the Multiple Change Points in Gamma Process Using X-bar chart
The process personnel always seek the opportunity to improve the processes. One of the essential steps for process improvement is to quickly recognize the starting time or the change point of a process disturbance. Different from the traditional normally distributed assumption for a process, this study considers a process which follows a gamma process. In addition, we consider the possibility o...
متن کامل